Journal: Emerging Infectious Diseases
Article Title: Disease-Associated Streptococcus pneumoniae Genetic Variation
doi: 10.3201/eid3001.221927
Figure Lengend Snippet: Resubstitution estimate and cross-validation results based on random forest models used in study of disease-associated Streptococcus pneumoniae genetic variation*
Article Snippet: For the additional validation analysis that used the best RF classifier k-mer (in srtG1 ), 2 independent datasets of S. pneumoniae genomes with genotype distribution similar to that of our study were available on the National Center for Biotechnology Information Assembly database ( https://www.ncbi.nlm.nih.gov/assembly (data1: 60 noninvasive vs. 60 carriage isolates; data2: 60 invasive versus 60 carriage isolates; the prevalence of the predominant GPSCs [GPSC1 and GPSC321] was 58.3% for noninvasive, 55.0% for invasive and 30.0% for carriage isolates) ( Table 4).
Techniques: